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Abstract We select the Luminous Infrared Galaxies by cross-correlating the Faint 
Source Catalogue (FSC) and Point Source Catalogue (PSC) of the IRAS Survey 
with the Second Data Release of the SDSS for studying their infrared and optical 
properties. The total number of our sample is 1267 for FSC and 427 for PSC by 
using 2(7 significance level cross-section. The "likelihood ratio" method is used to 
estimate the sample's reliability and for a more reliable subsample (908 for FSC 
and 356 for PSC) selection. Then a Catalog with both the infrared, optical and 
radio informations is presented and will be used in further works. Some statistical 
results show that the Luminous Infrared Galaxies are quite different from the 
Ultra-Luminous Infrared Galaxies. The AGN fractions of galaxies with different 
infrared luminosities and the radio to infrared correlations are consist with previous 
studies. 
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1 INTRODUCTION 

The research of Luminous Infrared Galaxies (LIGs, the galaxies with infrared luminosity 
(Lir, 8-1000 /xm) higher than 10 11 L Q ) began after the success of the first mid- to far-infrared 
all-sky survey carried out in 1983 by the Infra-Red Astronomical Satellite (IRAS). The physical 
properties of the LIGs, especially the Ultra-Luminous Infrared Galaxies (ULIGs, Lir, > 10 12 L©) 
were studied by using the IRAS infrared data and the follow-up optical (POSS, DSS, HST, VLT 
...) observations, such as the analyses of the Bright Galaxy Sample (BGS, Soifer et al. 1987b), 
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the optical spectroscopy of LIGs (Kim et al. 1995; Veilleux et al. 1995), the statistical study of 
the spectra of very luminous IRAS galaxies (Wu et al. 1998ab), the IRAS 1 Jy Survey of ULIGs 
(Kim et al. 1998ab) and the Point Source Catalog redshift survey (PSCz, Saunders et al. 2000). 
From the previous studies people found that most of the ULIGs arc in an interaction/merger 
system (Zou et al. 1991; Sanders et al. 1988; Kim et al. 1995; Lawrence et al. 1989) and with 
a high AGN fraction (Kim et al. 1995,2002; Wu et al. 1998ab). There is a possible evolution 
path (Sanders et al. 1988; Sanders & Mirabel 1996) from galaxy mergers to quasi-stellar objects 
(QSOs) and elliptical galaxies, which supports the hierarchical galaxy formation theory (Cole 
ct al. 2000). The LIGs with L IR - 10 11 - 10 12 L Q are quite different from the ULIGs in their 
morphologies and spectral features. The recent studies of the distant LIGs (0.4<z<1.2, Zheng et 
al. 2004) showed that there are many massive disks which have been forming a large fraction of 
their stellar mass since z = 1, and most of their central parts were formed prior to the formation 
of their disks. Although the LIGs are so important for studying, there has not been a large and 
reliable sample of LIGs for statistical analyses, so lots of physical properties the LIGs are still 
unclear. The role of LIGs and ULIGs in the formation and evolution of the galaxies is still a 
problem to be resolved. 

In order to study the properties of the LIGs in more detail, we need a large sample which 
has both the infrared and optical informations for our analyses. The Sloan Digital Sky Survey 
(SDSS) was chosen for the cross-correlation with IRAS data because of its large sky coverage 
(^2627 deg 2 for spectroscopic targets of the second data release) and high spectral signal-to- 
noise (S/N) ratio and spectral resolution (R ~ 1800). Although some authors have studied 
the optical properties for IRAS galaxies using the SDSS data (Goto 2005b; Pasquali et al. 
2005), their cross-correlation between optical and infrared catalogs is relatively simple (only 
use a fixed circle) for a reliable sample selection and they didn't present a complete catalog for 
further analyses. The structure of this paper is as follows: In Sect. 2 we give a simple description 
of the data and the cross-correlation between IRAS and SDSS; In Sect. 3 we use the "likelihood 
ratio" method for detailed identifications for our sample and estimate its reliability; In Sect. 4 
we describe our Catalog; In Sect. 5 we do some statistical works based on a selected subsample. 
Finally the summary is given in Sect. 6. We adopt cosmological parameters Hq=70 kms _1 Mpc _1 , 
O m =0.3, fl\=0.7 throughout this paper. 

2 DATA DESCRIPTION AND SAMPLE SELECTION 

2.1 IRAS Faint Source Catalog and Point Source Catalog 

The Infra-Red Astronomical Satellite (IRAS) was launched in 1983 (Neugebauer et al. 1984; 
Soifer et al. 1987a) and scanned almost all the sky in mid- and far-infrared (12, 25, 60, 100 //m) 
wavebands. The Faint Source Catalog (FSC, |b| > 10, Version 2.0, Moshir+ 1989) was released 
after the Point Source Catalog (PSC, Version 2.0, IPAC 1986). It contains data for 173044 
point sources in unconfused regions with flux densities typically above 0.2 Jy at 12, 25 and 60 
yitm, and above 1.0 Jy at 100 /xm, achieves roughly one-magnitude deeper in sensitivity relative 
to the PSC. The catalogues (both the FSC and PSC) give the IRAS sources' four band flux 
densities and qualities, the positions of the sources, and other useful parameters. The sources 
in the catalogues all have large positional uncertainties which can be described as an "error 
ellipse". The error ellipse stands for the uncertainties along (in-scan) and cross (cross-scan) 
the IRAS's scan direction, and the uncertainty ellipse major axis, minor axis and positional 
angle in the catalogues are used for describing it. The FSC is deeper than PSC but may be 
contaminated by foreground and background sources, the PSC is shallower but can be used for 
a comparison with previous results (e.g., the PSCz). Therefore, we use them separately to make 
up our sample and do statistical analyses based on each of them. 
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2.2 SDSS-DR2 Data 

The Sloan Digital Sky Survey (SDSS, York et al. 2000) contains an imaging survey of northern 
sky in the five bands u, g, r, i, z and a spectroscopic target survey performed by multi fibers. The 
Second Data Release (DR2, Abazajian et al. 2004, Version v2_20040928_1505) was released in 
2004. The SDSS-DR2 spectroscopic target survey covers about 2627 deg 2 of the sky, including 
about 260490 galaxies, 32241 quasars, 3791 high-z (z > 2.3) quasars and others objects. For the 
study of the detailed spectral properties of LIGs (such as their emission lines), we only choose 
the SDSS-DR2 spectroscopic targets with the redshift greater than 0.001 (to reject stars) and 
high redshift confidence (zConf > 0.9) to do the cross-correlation. Finally we obtain 268202 
sources from SDSS datasets as our candidates for the cross-correlation with IRAS catalogues. 

2.3 Cross- Correlation between the IRAS and SDSS 

We use the IRAS (FSC and PSC, separately) error ellipse as the cross-section (the SDSS's 
position uncertainties are neglected compared with the IRAS's) to do cross-correlation with the 
SDSS sources spectral positions. Two RMS uncertainty (2a) significance level was chosen for a 
high level confidence and more complete sample selection. The SDSS spectral redshift and the 
IRAS flux densities were then used to calculate the infrared luminosity (Lir) of the matched 
sources. Due to the fact that the 12^m and 25^m flux densities of the objects are mostly 
the "upper limit" (flux quality = 1), we calculate the far-infrared luminosity (Helou et al. 
1988; Sanders & Mirabel 1996) and then convert it to the total infrared luminosity (1-1000/iin, 
Calzetti et al. 2000) 1 : 



where fgo, fioo are the IRAS Rux densities in Jy at 60 and 100 /*m respectively. Then the LIGs 
(Lir > 10 11 L Q ) were chosen as our sample objects and the number of sources is 1267 for 
FSC and 427 for PSC 2 . From this sample we present a Catalog (will be described in Sect. 4) 
and perform detailed identifications and further analyses. Fig. 1 is the sky coverage of our 
sample (both FSC and PSC) in equatorial coordinates, which shows that it covers nearly all 
the SDSS-DR2 spectroscopic survey regions. 

2.4 VLA-FIRST Data 

The NRAO Very Large Array (VLA) Faint Images of the Radio Sky at Twenty-centimeters 
(FIRST) data (Becker et al. 1995) are used here for studying the radio properties of our sam- 
ple. The FIRST survey is a project designed to produce the radio equivalent of the Palomar 
Observatory Sky Survey (POSS) over 10 4 deg 2 of the North and South Galactic Caps. The 
FIRST Survey Catalog (White et al. 1997, from the 1993 through 2002, contains ~ 811000 
sources and covers ~ 9030 deg 2 ) including peak and integrated flux densities and size informa- 
tion is generated from the coadded images. The individual sources have 90% confidence error 

1 The contribution to the total infrared luminosity from the 1-8/nm regime is expected to be of the 
order of a few percent (Calzetti et al. 2000). 

2 Note that the objects with 60/im flux quality =1 have been rejected. We didn't treat the objects 
with 100/xm upper limit because it doesn't affect much on the calculation of Lir (see Sect 5.2 for 
details) . 



.Ffir = 1.26 x 10- 14 {2.58/ 60 + fwa}[Wm- 2 } 

Lfir = 4nDlF Fm [L Q ] 
L m (l - 1000/im) = 1.75L F ir 



(1) 



(2) 
(3) 
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Fig. 1 Distribution on the sky of the objects in our sample. This is an Aitoff projection 
in equatorial coordinates. Left: The FSC sample; Right: The PSC sample. 



circles of radius < 0.5" at the 3mJy level and 1" at the survey threshold (~ lmJy). The survey 
area has been chosen to coincide with that of the SDSS First Data Release (DR1) and ~ 50% of 
the optical counterparts to FIRST sources will be detected. We use the FIRST Survey Catalog 
updated at 2003 April 11 to perform the cross-correlation with the objects in our sample. 

We match our sample's SDSS spectral positions with the VLA FIRST positions using a 
2" searching radius and find that there are 624 objects for FSC and 258 for PSC which are 
contained in the FIRST catalog. This result means that the radio flux densities of these sources 
are all above the FIRST'S threshold (about lmJy). Thus they have a higher probability to be 
true IR sources because of the (far-) infrared to radio correlation (will be discussed in Sect. 5, 
Helou ct al. 1985,1993; Condon 1992; Ivezic 2002). 



2.5 Reliability and Completeness 

Due to our large 2a cross-sections for the cross-correlation, there are also some SDSS 
objects which are not really the IR sources being selected as our sample objects because of the 
contamination of foreground and/or background sources. So we calculate the random probability 
that the SDSS-DR2 spectroscopic targets fall into the IRAS 2a error ellipse by assuming that 
the SDSS targets arc uniformly distributed across the 2627 deg 2 sky and the mean IRAS 2a 
error ellipse area is about 0.56 arcmin 2 for the LIGs. The random probability is about 4.32% 
for FSC sample and 5.02% for PSC and hence our whole sample's reliability is about 95.68% 
(FSC) and 94.98% (PSC) (R = 1 - N random /N rca i). 

The completeness of our sample can be estimated from the 2a error ellipse cross-section, 
the incompleteness introduced by this term alone is about 10% assuming Gaussian distribution. 
And it also may be affected by several factors: 

1. We only select the SDSS targets with high confidence rcdshift (zConf > 0.9) as our can- 
didates, which will lead the probability that the targets without high quality redshift estimates 
will be rejected. The incompleteness increases from 1% for the bright objects to 6% in the faint 
end. 

2. Because of the target magnitude limit of the SDSS spectroscopic survey (Petrosian mag_r 
< 17.77 for main galaxies and PSF mag_i < 19.1 for quasars), there are also some optically 
faint LIGs which could not be included in the SDSS spectroscopic survey. So they are missed 
mainly due to their relatively higher redshift or serious obscuration by dust. 
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3. There are also missing galaxies due to lack of fibers in dense regions, spectroscopic failures, 
and fiber collisions, which can be defined as the sampling rate: f t ~ 0.92 in average. (Blanton 
et al. 2001) 



3 "LIKELIHOOD RATIO" METHOD 

It is not easy to determine whether the matched SDSS targets are really the infrared objects or 
not. So we use the "Likelihood Ratio" (LR) method (Sutherland & Saunders 1992) to calculate 
the probability of the "true" cross-correlation for each matched SDSS object. 

The likelihood ratio method is defined as that the cross-correlation probability between two 
observed sources are (assume that the errors are Gaussian in common) 3 : 

LR = Q(< mi )exp(-r 2 /2) 
2ira & a\,n(< m;) 

In this formula, r is the " normalized distance" : 

r2= (al-af + (61-62)3 

(al, bl) and (a2, b2) are the positions of each source, a terms standard deviations and n(< m ; ) 
is the local surface density of objects (galaxies) brighter than the candidate. The Q(< m;) is the 
multiplicative factor in the numerator which represents a priori probability that a "true" optical 
counterpart brighter than the flux limit exists amongst the identifications, and for simplicity 
we set Q = 1 in this work. 

For our sample, the SDSS position uncertainties can be neglected compared with the IRAS , s 
large error ellipse. In this work, we refer to the IRAS uncertainty ellipse major axis (UncMaj) 
as cr a , minor axis (UncMin) as Ob and the position of the SDSS object in the IRAS 2a error 
ellipse (in the unit of a, from to 2) as r. We use the SDSS photometric targets to get n(< m ; ): 

n{< m; = (6) 

47T(7 a (7b 

N(< mi) stands for the number of galaxies with r band magnitude less than or equal to the 
candidate's in the corresponding IRAS 2a error ellipse. Then we can get the LR formula for 
our sample: 

N(< m ; ) v ; 

We calculate all of our samples' likelihood ratio values by using the SDSS photometric data 
(r band Petrosian magnitude for galaxies and i band PSF magnitude for QSOs). Then a random 
sample is selected for estimating the reliability of each object (use the method developed by 
Lonsdale et al. 1998; Rutledge et al. 2000; Masci et al. 2001), which is used to assess the cross- 
correlation probability and select a more reliable subsample. We also calculate the LRs and 
reliabilities for the PSCz sample (Saunders et al. 2000, all these optical targets selected from 
the PSC are identified as "true" IR objects) overlapped with our PSC sample for a comparison. 
The reliability distributions of the FSC, PSC and PSCz sample are shown in Fig. 2. 



Note that the cross-scan errors for faint galaxies of IRAS are less Gaussian (IRAS Explanatory 
Supplement VII. Analysis of Processing C. Positional Accuracy), but this doesn't affect much on our 
statistical results of this work. So we use the Gaussian assumption and the Likelihood ratio method 
here and will try to improve it in further works. 
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Fig. 2 This figure shows the distributions of our sample's reliabilities. The solid line 
is the FSC sample's, the dashed line is the PSC sample's, and the dotted line is the 
PSCz sample's. 



4 THE CATALOG 

We present a Catalog (in ascii table) for the our sample of the LIGs, which contains the 
IRAS, SDSS-DR2 and FIRST informations. The structure and content of our Catalog 4 are as 
follows: 

The IRAS data (f(p)sciras.cat): the IRAS (FSC and PSC) name; IRAS RA and DEC; the 
error ellipse major (UncMaj), minor axis (UncMin) 5 and position angle; 12, 25, 60 and 100 
/im flux densities and qualities; and the calculated infrared luminosity using the SDSS spectral 
redshift. 

The SDSS-DR2 photometric data (f(p)scsdssphotol(2).cat): the SDSS ObjID; Photometric 
RA and DEC; objType and probPSF; SDSS five bands modelMag, psfMag, fiberMag, petroMag 
and their errors; Galactic extinctions; petroR50 and pctroR90 for band r. 

The SDSS-DR2 spectroscopic data (f(p)scsdssspecl.cat): the SDSS SpecObjID; 
Spectroscopic RA and DEC; spectral redshift and its error; eclass and eCoeff; zWarning and 
zStatus; SpecClass, mjd, plate, fibcrlD. 

The SDSS-DR2 emission line data (f(p)scsdssspec2.cat, from MPA-SDSS: www.mpa- 
garching.mpg.de/SDSS, Trcmonti et al. 2004, Version 5.0_4): the Ha, H/3, [OII]AA3727,3729, 



4 The Catalog will be put on the web, use this URL: 

5 Note that the UncMaj and UncMin in the PSC stand for 1.96a significance level. 
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[OIII]A5007, [NII]A6584, [SII]AA6716,6731 and [OI]A6300 emission line's fluxes and flux errors; 
the corresponding Equivalent Widths (EQWs) and errors. Based on these data, we classify 
our sample into several spectral types: a) The galaxies without apparent emission lines (NoE 
for short) arc chosen by the criterion: Ha EQW > -5A. 6 ; b) The QSOs/Seyfert Is (SI) are 
those with Broad Line Regions (BLRs) and are also classified as QSOs by SDSS pipeline (spec- 
Class = 3); c) The classification of narrow emission line galaxies (Seyfert 2s, LINERs and HII 
regions) are performed using the emission line fluxes ratios, methods and the considered line ra- 
tios arc: [OIII]A5007/H/3, [NII]A6584/Ha, [SII](A6716+A6731)/Ha, [OI]A6300/Ha (Osterbrock 
1985,1989; Wu et al. 1998b; Kauffmann et al. 2003c; Kewley et al. 2001). Specifically, for Seyfert 
2s (S2): [OIII]/H/3 > 3; For LINERs (L): [NII]/Ha > 0.6, [SII]/Ha > 0.4, [OI]/Ha > 0.05 and 
[OIII]/H/3 < 3; For HII galaxies (H): [Nil] /Ha < 0.6, [SII]/Ha < 0.4, [OI] /Ha < 0.05 and 
[OIII]/H/3 < 3; The mixture types (LH: Mixture of LINERs and HIIs) are those which locate at 
the border of different spectral populations. The mixture type galaxies could be a transitional 
phase from HII galaxies to AGNs (Wu et al. 1998b). And there are also some galaxies which 
are not in the MPA's emission line catalog, so we classify them as Unknown (?). We will discuss 
this classification in detail in Sect. 5.3. 

The VLA FIRST radio data (f(p)scfirst.cat): The VLA FIRST data (described in Sect. 
2.4) contains: the FIRST name; FIRST RA and DEC; peak and integrated flux densities at 
1.4GHz; the local noise estimate; major and minor axis (FWHM), position angle; fitted MajAxis, 
MinAxis and PA before deconvolution; name of the coadded image containing the source; and 
based on the cross-correlation we give a "flags" for our sample: stands for the case that the 
SDSS object is correlated with a FIRST source within 2" and 1 stands for that there arc no 
FIRST counterparts in the corresponding search radius. 

We give each source a new index number for each FSC and PSC sample, and will do further 
works based on it. 

The main catalog (f(p)sc_main.cat) contains only the most important informations we need, 
includes: the source number, the likelihood ratio (LR) and the Reliability we calculated in 
Sect. 3, the IRAS name, the infrared luminosity, redshift, SpecObjID, Spectroscopic RA and 
DEC, SpecClass, ObjID, modelMag_r, extinctions, petroMag_r, the FIRST flag, the SDSS 
object's position in the IRAS error ellipse (in the unit of a), the spectral types and the sign of 
the same sources across the two (FSC and PSC) sample. 



5 ANALYSES AND RESULTS 

5.1 Subsample Selection 

For the purpose of high confidence analyses we need a subsample with relatively high 
reliabilities for further works. From the comparison between our sample and the random sample 
(discussed in Sect. 3 and shown in Fig. 2), here we give a selective criterion as the Reliability > 
0.98 for a relatively high cross-correlation probability. We choose this criterion for the subsample 
selection, and it contains 908 objects for FSC and 356 for PSC. From the comparison of the two 
redshifts (derived from our PSC sample, PSC subsample and the PSCz sample) of the same 
IRAS source (Fig. 3), we find that our subsample (at least the PSC) is more reliable because 
the sources' redshifts are consistent through the two sample except for only two sources. We 
also estimate our subsample's completeness from the LR distribution of the PSCz sample and 
find that it is about 86.69% if use the same selective criterion. 



Absorption lines have a positive sign. 
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Fig. 3 This figure shows the comparison between our PSC sample's and the PSCz 
sample's rcdshifts. (a):The PSC whole sample vs. PSCz; (b): The PSC subsample vs. 
PSCz. 
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Fig. 4 This figure shows the redshift distribution of our LIGs subsample. (a): The 
FSC subsample; (b): The PSC subsample. The solid lines are for the whole subsample, 
the dashed lines are for the LIGs (Lir ~ 10 11 - 10 12 L Q ), and the dotted lines are for 
the ULIGs (Lir > 10 12 L Q ). 



5.2 Basic Statistical Properties 

The redshift and the Lir distribution of our subsample are shown in Fig. 4 and Fig. 5. The 
number of LIGs (N L i Gs , which L IR - 10 11 - 10 12 L ) is 873 for FSC and 334 for PSC, and 
Zmcdian ~ 0.08 (FSC) and 0.05 (PSC). For the ULIGs (which L IR > 10 12 L Q ), N UL i Gs is 35 
(FSC) and 22 (PSC), and z mcdian ~ 0.18 (FSC) and 0.17 (PSC), - 0.1 higher than the LIGs. 
The ratio NuliGs:Nligs is 0.04 for FSC and a higher value 0.07 for PSC. For a comparison of 
the infrared luminosities derived from FSC and PSC (see Fig. 6), we find that the Lir derived 
from FSC is consist with that from PSC by using the formula given in Sect 2.3. 

The color (u-r) distributions of our subsample are shown in Fig. 7. Compared with the 
color separation of galaxy types described by Strateva et al. (2001), our result shows higher u-r 
values. The serious dust extinction of the LIGs, especially the ULIGs may be responsible for 
the redder color of our subsample. 
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Fig. 5 This figure shows the infrared lumi- 
nosity distribution of our LIGs subsample. The 
solid line is for the FSC, and the dashed line 
is for the PSC. 
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Fig. 6 This figure shows a comparison be- 
tween the infrared luminosities derived from 
the FSC and the PSC subsample, all with 
60/im flux qualities = 2 or 3. 
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Fig. 7 This figure shows the color (u-r) distribution of our LIGs subsample. (a): The 
FSC subsample; (b): The PSC subsample. The solid lines are for the LIGs (Ltr ~ 
10 11 - 10 12 L ), and the dotted lines arc for the ULIGs (L m > 10 12 L Q ). 



5.3 AGN Fraction 

Throughout this paper, we term AGNs as the assembly of the Seyfert Is, Seyfert 2s, LINERs, 
and the Mixture types (S1+S2+L+LH, the spectral types are described in Sect. 4). The BPT 
(Baldwin et al. 1981) diagrams for classifying the narrow emission line galaxies (Seyfert 2s (S2), 
LINERs (L), HIIs (H) and the Mixture types) are shown in Fig. 8. The number and fractions of 
each type are listed in Tables 1,2 and the distribution versus L IR , of our subsample is shown in 
Fig. 9 (the galaxies classified as Unknown(?) have been removed). Note that we have performed 
a volume correction by giving each objects a weight equal to the inverse of its maximum visibility 
volume: 1/Vmax (Schmidt 1968; Kauffmann et al. 2003ab), with a magnitude and flux cutoff 
for correcting the selection biases. We calculate the Vmax as follows: 

; n i \ magum - mag 

logDi(max) SBSS = h logD\(z) (8) 

5 

A(maaOiRAS = AW(-^-) 1/a (9) 
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Fig. 8 These figures show the BPT diagrams of our LIGs subsample. The straight 
lines are the criterion we used in this paper to classify the Seyfert 2s (S2) , LINERs 
(L) and HIIs (H). The solid curves are the criterion given by Kewley et al. 2001 
for separating the starbursts and AGNs, and the dashed curves in figure (a) are the 
criterion given by Kauffmann et al. 2003c. 



In this equation magi; m is the SDSS magnitude cutoff (Petrosian mag_r — 17.5), and f60i; m is 
the IRAS 60/zm flux cutoff (0.3Jy for FSC, 0.6Jy for PSC). Then the Di(max) for our estimation 
is the minimum of Di(max)sDSS anc l Di(max)iRAs, so: Vmax = 4/37rDi 3 (max). 

The AGN fractions of our subsample increase with the infrared luminosities, from ~45% to 
80% when Lir increases from 10 11 to 10 13 L©. This is in agreement with the previous results 
that the AGN fraction increases from the LIGs to ULIGs, from 47% to 70-75% (Kim et al. 
1995; Veilleux et al. 1995,1999) and 56% to 82% (Wu et al. 1998b). From Tables 3,4 we also 
find that some galaxies without apparent emission lines (NoE) have high Ljr, especially for 
PSC subsample (due to their relative higher Lir). These galaxies may be either: a) Have low 
S/N ratios or bad spectra; b) One member of a galaxy pair or group, and the large amount of 
infrared emissions may come from its companions; c) Have late stage merger feature and e(a) 
spectral feature (Poggianti & Wu 2000) or E+A feature, which indicates a post-starburst phase 
(Zabludoff et al. 1996; Yang et al. 2004; Goto 2005a). 
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Table 1 The spectral type distribution with the infrared luminosity of FSC subsam- 
ple, the errors for AGN fractions are based on Poisson statistics. 



Spectral Type 


logL IR (L ) - 11.0-11.5 


11.5-12.0 


> 12.0 


Sl a 


0.94%(6) b 


0.75%(2) 


7.87%(4) 


S2 


5.22%(22) 


3.89%(5) 


0.00% (0) 


L 


9.05%(27) 


8.41%(10) 


5.53%(1) 


LH 


30.49%(100) 


36.52%(44) 


64.71%(5) 


H 


49.98%(183) 


47. 70% (58) 


21.88%(1) 


NoE 


4.33%(13) 


2.73%(5) 


0.00%(0) 


Total 


351 


124 


11 


AGN 


45.69±3.67%(155) 


49.57±6.35%(61) 


78.12±24.70%(10) 



a The spectral types: SI, S2, L, LH, H and NoE stand for the Seyfert Is, Seyfert 2s, LINERs, Mixture 

types and HIIs as described in Sect. 4. 
b The volume corrected fraction of different spectral types in each Lir bin, the number of each type 

galaxies is in the bracket. 



Table 2 The spectral type distribution with the infrared luminosity of PSC subsam- 
ple. 



Spectral Type 


logLi R (L ) ~ 11.0-11.5 


11.5-12.0 


> 12.0 


SI 


1.59%(3) 


2.59%(2) 


3.69%(3) 


S2 


6.31%(7) 


6.43%(3) 


0.00%(0) 


L 


7.34%(9) 


4.74%(2) 


0.00%(0) 


LH 


37.53%(47) 


33.83%(22) 


74.94% (6) 


H 


45.53%(70) 


50.06%(30) 


21.37%(1) 


NoE 


1.69%(4) 


2.34%(2) 


0.00%(0) 


Total 


140 


61 


10 


AGN 


52.78±6.50%(66) 


47.60±8.84%(29) 


78.63±26.21%(9) 



FSC_SUB 




11. 0-1'. 5 11.5-12.0 >12.0 11.0-1'.5 11.5-12.0 >12.0 



log(L, R /L s „„) log(Li R /L sun ) 

Fig. 9 This figure shows the spectral type (Seyfert Is (SI), Seyfert 2s (S2), LINERs 
(L), Mixture types (Mix), HIIs (H) and No apparent Emission lines (NoE), colored 
from black to white) distribution vs. the infrared luminosity of our subsample. Left: 
The FSC subsample; Right: The PSC subsample. 
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Fig. 10 This figure shows the Infrared (60/xm) to Radio (1.4GHz) correlation of the 
subsample. (a): The FSC subsample; (b): The PSC subsample. The straight line is 
the best fit obtained by Yun et al. (2001) for an all-sky sample of infrared detected 
galaxies from IRAS. 



5.4 Infrared to Radio Correlation 

The infrared to radio correlation of our subsample is shown in Fig. 10, we calculate the 
L60/xm and Li. 4 gh z using the formula (Yun et al. 2001): 

logL 60t , m (L Q ) = 6.014 + 2logD + logS^™ (10) 

logL^G^WHz- 1 ) = 20.08 + 2logD + logS 1AGHz (11) 

where D is the luminosity distance in Mpc and Sgo^m and Si.4gh z arc flux densities in units of 
Jy. The straight line is the best fitting line obtained by Yun et al. (2001) for an all-sky sample 
of infrared detected galaxies from IRAS: 

logL 1AGHz = (0.99 ± O.Ol)ML 6OMm /L ) + (12.07 ± 0.08) (12) 

From these relations we find that the infrared to radio correlation for our subsample are follow 
the correlations for an all-sky sample of infrared detected galaxies from IRAS (Yun et al 2001). 
The slight deviation for the PSCJ3UB is not significant and smaller than the scattering of the 
infrared to radio correlation. 

The q parameter is also plotted for our subsample in Fig. 11, following the formula (Condon 
et al. 1991): 

M2.58S l 60/im + SlOO^m . ,S\.4GUz^ M „\ 
2MTy ~~ Jy (13) 

The solid line is at q = 2.34 which is the mean value obtained by Yun et al. (2001), the top 
and bottom dotted lines are limits for three times FIR excess and radio excess from the mean 
respectively. The radio excess objects are mainly Radio Loud (RL) AGNs (Roy & Norris 1997) 
that may have some complex mechanisms of energy generation (e.g. the jet emission). 



6 SUMMARY 

In this paper we select a sample of Luminous Infrared Galaxies based on the cross-correlation 
between the IRAS FSC and PSC data and the SDSS-DR2, and present a Catalog. We use 
the "likelihood ratio" method to estimate the sample's reliability and for a high confidence 



(a) 
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FSC_SUB PSC_SUB 



■3* 
i . - -' . 



10 11 12 

l°g(L B0(1 /L su „) 



(b) 



'jfe. ■■■■ 



10 11 12 



13 



Fig. 11 This figure shows the q parameter for the subsample. (a): The FSC subsamplc; 
(b): The PSC subsample. The solid line is at q = 2.34 which is the mean value obtained 
by Yun et al. (2001), the top and bottom dotted lines are limits for three times FIR 
excess and radio excess from the mean respectively. 



subsample selection. Although the LR method also has some problems and needs to be improved, 
it seems that it can be used as a stable and creditable sample selection method based on the 
analyses and comparison in this work. From the statistical analyses (e.g., the redshift, Lir 
and color distributions, the spectral types, and the radio to infrared correlations) we find that 
the LIGs and ULIGs are quite different. We will perform further analyses in the future and 
attempt to know more about the LIGs, such as their morphologies and environments (Wang et 
al. in preparation), the origins of the IR excess (Pasquali et al. 2005) and their star formation 
histories. Some interesting subsamples like the IR QSOs (Zheng et al. 2002; Hao et al. 2005) and 
RL AGNs (Best et al. 2005) will also be selected and analyzed for understanding the connections 
between the star formation and AGN activity. During such works we will keep on finding better 
statistical methods for huge astronomical data mining and analyses. 
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